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ABSTRACT 

We develop a general theory for estimating the probability that a galaxy cluster of a given shape 
exists. The theory is based on the observed result that the distribution of galaxies is very close to 
quasi-equilibrium, in both its linear and nonlinear regimes. This places constraints on the spatial 
configuration of a cluster of galaxies in quasi-equilibrium. In particular, we show that that a cluster 
of galaxies may be described as a collection of nearly virialized subclusters of approximately the same 
mass. Clusters that contain more than 10 subclusters are very likely to be completely virialized. 
Using our theory, we develop a method for comparing probabilities of different spatial configurations 
of subclusters. As an illustrative example, we show that a cluster of galaxies arranged in a line is 
more likely to occur than a cluster of galaxies arranged in a ring. 

Subject headings: cosmology: theory — galaxies: clusters: general — gravitation — large-scale struc- 
ture of universe — methods: analytical — methods: statistical 



1. INTRODUCTION 

Clusters and groups of galaxies are commonly studied 
structures in the universe, and have been defined using 
various criteria; groups are smaller clusters. These clus- 
ters may contain over a thousand member galaxies and 
occupy a few cubic megaparsecs. Although clusters can 



be identified by the ir density and size (e.g. Abell 1958 



Herzog et al.||1957) their shapes and structures ditter 
I'or example, Herzog et al. (1957) identify three classes 
of clusters: compact clusters which have a single nearly 
spherical dense concentration of galaxies, medium com,- 
pact clusters which are less dense and may have multiple 
concentrations of galaxies and loose clusters which do 
not have any outstanding concentrations of galaxies. 

Even these simple classes of clusters suggest greater 
complexity than just s pherical concentratio ns in a region 
of space. 



For example, 
icant substructure in t 



Binggeli et al.| ( |1987 [) found signif- 
le core of the 



Virgo cluster as well 
as pro nounced double structure, although [Herzog et al.| 
(1957) classify the Virgo cluster as a medium compact 
cluster. The structure of the Virgo cluster, as the near- 
est large cluster, suggests that these irregular shapes are 
common. 

Such irregular shapes may result from mergers. 
Smaller groups fall into the central region of a cluster and 
form subgroups whose member galaxies are still tightly 
bound to each other. Irregular shapes resulting from sub- 
groups then disappear as a cluster virializes. However, 
many clusters have dynamical relaxation timescales on 
the order of a Hubble time, and their incomplete viri- 
alization suggests that irregular clusters with multiple 
concentrations of galaxies should be common in the uni- 
verse. 

The basic dynamical description of a cluster is its 6- 
dimensional phase space configuration such as a sphere 
with a density and velocity profile. More detailed de- 



scriptions of clustering include correlation functions, per- 
colation trees and counts-in-cells statistics. In particu- 
lar, the counts-in-cells description is especially suitable 
for this problem because it straightforwardly analyzes re- 
gions of space (cells) with a specified size and shape. In 
addition, the physics of this descript ion can be derived 
from gravitational thermodynamics (Saslaw & Hamilton! 



1984) or statistical mechanics (Ahmad ct aL2002) where 
the basic particles of the system are galaxies that interact 
in a grand canonical ensemble of cells. 

This physical description leads to the gravitational 
quasi-equilibrium distribution (GQED) for the counts-in- 
cells distribution of galaxies. The G QED has been sh own 
to agree with A^-body simulations ( |Itoh et aL 1988| and 
subsequent work) and various analyses ot sky surveys in 
both the linear and nonlinear regimes of clustering (e.g. 
ISivakott fc S'aslawl [20051 |Yang fc Saslaw] [2011] and ref- 
erences therein). This indicates that the statistical me- 
chanical theory is a suitable description of clustering. 

The theory was subsequen tly extended to descri be par- 
ticles with different masses (]Ahmad et al. 2006a), three 



body interactions ( Ahmad et al.|20 06b ) and ditterent in- 
ternal structures (IVa ng et al.]|2(Jlip . it was also further 
generalized to takeTihto account higher orders in the se- 
ries expansion of the gravitation al interaction d Saslaw fc 



Ahmad]]2010| , and extended by ]Leong fc Sasl aw (200-^ 
to describe tne potential and kinetic energies in a single 
cell, and hence the probability that a cell with N galaxies 

would be virialized. 

In t his paper, we extend the work by jLeong fc Saslaw 
(12004) and relate the ratio of potential energy to kinetic 
energy in a cell to its detailed configuration. Here, we 
work with cells rather than clusters because cells are well- 
defined regions of space and, unlike clusters, they have 
a clear boundary. Another advantage of working with 
cells is that we can use larger cells to study how clusters 



of galaxies cluster around each other, and hence provide 
insights into the process of hierarchical structure forma- 
tion. 

This paper is structured as follows: In section [2] we 
rescale the thermodynamic variables to describe the 
properties of a self-gravitating system in a form that is 
independent of physical units. This provides a scale-free 
description of particles in a cell. These may be galaxies 
in a group, or substructures in a cluster, or even clusters 
in a large cell for insights into hierarchical clustering. In 
section p] we use the scaled thermodynamic variables to 
determine the probability of finding a cell with a given 
kinetic energy or virial ratio in quasi-equilibrium. This 
lets us determine the conditions that produce a virialized 
or unvirialized cell, and its implications for substructure 
in these cells. In section |4] we derive a relation between 
the configuration, energy and probability of a cell. In 
section [5] we describe a procedure for studying the in- 
ternal structure of a cell and apply this procedure to an 
illustrative example. Finally in section l6J we summarize 
our results. 

2. SCALED ENERGIES 

We consider spatial cells with a fixed volume and 
shape, containing TV gravitating "particles" which for 
simplicity, have the same mass m. We show in Appendix 
[A| that if "particles" have different masses, the "parti- 
cles" with masses within an order of magnitude of the 
most massive "particle" will dominate the total poten- 
tial energy. 

For these conditions, the scale of a cell and the mass 
of a "particle" are free parameters, and the statistical- 
mechanical description may apply to very different 
regimes. "Particles" may be individual galaxies in a 
small cell, tightly bound groups in a larger cell, or even 
clusters of galaxies in very large cells. Therefore we ex- 
press t he energy of a cell in dimensionless form (Leong 



& Saslaw 2004) where we write the energy in terms o± a 
scaling factor based on the mass of a "particle" and the 
size of a cell. This lets us focus on the physical properties 
of clustering and ignore the scale dependent effects. 

Because the potential energy of a uniform density cell 
scales with the mass of a particle and the cell's radius, we 
use the average potential energy of a particle as a scaling 
factor. Then the temperature T from the kinetic energy 
K, the correlation potential energy W and total energy 
E — W + K are expressed in terms of scaled dimension- 
less quantities (with Boltzmann's constant unity) 



and 



E^ = 



2K 

3iVA' 

'' iNA 

2E 

'iNA 



(1) 
(2) 

(3) 



where A is a scaling factor with dimensions of energy: 



3 NGm^ / e 



(4) 



and C,{e/Ri) represents the extent of a particle and its 
halo. Here i?i is the radius over which the configuration 



integral is taken, which is the distance where the expan- 
sion of th e universe cancels out the smoothed background 
potential ( Saslaw & Fang 1996 ) . This is generally the dis- 
tance at which the two-point correlation function ^2(?') 
becomes negligible. 

The scaling factor A is proportional to the absolute 
value of the average potential energy of a particle, and 
comes from the potential energy con tribution to the par- 
tition function (c.f. equation (14) of Ahmad et al.|[2002 1 



Q2{T,V) = 



1 



Gto2 

Tri2 



K{r,e) 



d^nd^rs. (5) 



Here G is the gravitational constant, m is the mass of a 
particle. The «:(e,r) and ({e/Ri) factors are from mod- 
ifications of the usual point rnass potential for ex tended 
particles | Ahmad et al.)|2002[ [Yang et al.]l2011[ ). The 
extended length scale e is typically smaller man Ri . In- 
tegration by parts gives a relation between C and k: 



c 






2 



Ri 



rK(r, e)dr. 



(6) 



In the case of point masses, k = \ and C = 1- Except in 
extreme cases where e/i?i i s very large, C is ge nerally of 
order unity (c.f. figure 1 of Ahmad et al. 2002 ). 

The scaled energies given by equations ([l]) and ^ are 
essentially ratios of kinetic energy to potential energy, 
and total energy to potential energy respectively, while 
the scaled potential energy given by pj) describes the 
"compactness" of the spatial configuration. These ener- 
gies are an instantaneous snapshot of the positions and 
velocities of galaxies in a cell. 

Although these cells are not rigorously in equilibrium, 
they are in quasi-equilibrium. This means that the in- 
stantaneous values of the potential and kinetic energies 
for a single cell will fluctuate about the current aver- 
age for an ensemble. On a larger scale, the energies 
and thermodynamic quantities averaged over an ensem- 
ble of cells in quasi-equilibrium change slowly compared 
to the dynamical timescale within a cell, so intermediate 
time averages of the ensemble are stable. These fluc- 
tuations mean that the instantaneous energies are more 
more closely related to the local dynamics of a cell than 
to the thermodynamics of the ensemble. Nevertheless, 
since the instantaneous energies are distributed about 
their time-averaged values, they allow us to relate an ob- 
served cell to its thermodynamic quantities. 

In quasi-equilibrium, the thermodynamic quantities 
are averages over space and time, and are more closely 
related to the time-averaged energies of a cell so we use 
these averages as a basis for our theory. For a cell in 
quasi-equi librium, the equatio n of state for a canonical 



ensemble ( Ahmad et al 
energy of a cell U to its 



2002) relates the average total 
kinetic temperature T and the 



clustering parameter h = —W/2K through 
U='-—{l~2h) 



(7) 



for units of temperature where the Boltzmann constant 
is 1. 

This means that we can use ensembles with different 
average energies as an approximation for individual cells 
with different energies and take E k, U . Under this ap- 



proximation, the quasi-equilibrium energies are approx- 
imately equal to the time-averaged energies. For such 



ensembles, the c lustering parameter b is given by (Ah 
mad et al.|[2002l ) 

/3nT'\{e/Ri) 
l + PnT'^Cie/Ri) 



W 
'2K 



(8) 



where /3 = {?>/2){Gm?Y ^-^id T is the average kinetic 
temperature of the canonical ensemble. 

To relate the scaled energies to the definition of b in 
equation (|8| and to use th e ther modynamic quantities 
described m Ahmad et al. (12002} for further analysis, 



^/■^ and make the scale transformati' 



(9) 



we use Ri a n 

following [Ahmad et ^^ ( |2002[ ) 

^ ^ {Gm^f{R,Ty 

using th e scaling property of the partition function de- 
rived by Landau & Lifshitz (1980 page 93). This trans- 
formation avoids fractional powers in n and simplifies the 
derivation of the thermodynamic quantities. Under this 
transformation we have a different scaling factor a given 
by 



a= -{Gm^ffiC 



Ri 



= (5n(; 



Ri 



(10) 



and thus define a different set o f dimensionless parame- 
ters (c.f. [Leong fc Saslaw||2004| ) 

- T 2K 

T,= ^^ = -.TTT-^^ (11) 



and 



ai/3 3iVai/3 
2E 



Using E ^ U and equations ([7|, ([s]) and (10) we relate 
E^, to T* by 



_ 2r(i - 2b) _^t:-i 



37Vai/3 



which also gives 



W, 



2W 



2{E - T) 
3iVai/3 ~ 3iVai/3 



T^ + l 



2T, 



T 



(13) 



(14) 



These quantities, derived from the statistical mechanical 
theory of galaxies interacting in quasi-equilibrium, ap- 
ply to cells in quasi-equilibrium and represent the time- 
averaged local energies of a cell taken over the fluctua- 
tion timescale of the cell. In contrast, the quantities T^,, 
Wi, and El,, are a rescaling of the energies that are as- 
sociated with a specific phase space configuration, and 
describe instantaneous values of the energies in a cell. 

At this point, we emphasize that the scaled energies 
T*, W^ and E^, and correspondingly T*, W^, and £'*, 
are local to a cell and do not represent the average of the 
grand canonical ensemble. In fact, for a system in quasi- 
equilibrium, the fiuctuations in potential energy within 
a cell are prop ortional to the local kinetic energy fluctu- 
ations so that ( Saslaw et al. [199(5 Leong fc Saslaw|2004[ ) 



GmN 



(;) 



.{v^) 



(15) 



where (f ^) is the mean square velocity of galaxies and 
a is a local form factor that determines the kinetic and 
potential energy fluctuations. Generally a will vary from 
volume to volume, and comparing equations M and ( [l5| 

T, oc - (16) 

a 

which means that T^ is a local quantity that will vary 

from cell to cell. 

To relate E^, to -E*, we define the time-averaged lo- 
cal virial ratio ip and its instantaneous counterpart ip as 
the ratio of the absolute value of the local correlation 
potential energy to twice the kinetic energy of galaxies 
contained within a cell. Hence we get 



V--- 



W 
2K 



2T, 



(17) 



which may fiuc tuat c abo ut its t ime - averaged value. Us- 
ing equations (14), (111 and (10), the corresponding 
time-averaged value is 



^ 



W, _ 1 
2T, ^ tI + i' 
(3nT-^Cie/Ri) 



T. 



^3" 



l + /3nr-3C(e/i?i) 



(18) 



local 



which has a range of < '0 < 1. Although the form 
of equation (flSl) is similar to the form of equation (Isl), 



ip and ip are local quantities that describe the internal 
virial ratio of a cell whereas b is an average over the 
entire ensemble. Therefore ijj will fiuctuatc between cells 
and is distributed with an average value given by b. 

With the virial ratio, the scaling factors cancel out 
and the actual observed virial ratio ip fluctuates about 
its time-averaged value jjj. Hence the similarity be- 
tween equation ( 17 ) and ( 18 1 indicates that we can deter- 
mine the probability of finding a cell with a given time- 
averaged energy level, and relate this probability to an 
actual cluster whose time-averaged energies cannot be 
observationally determined. 

2.1. Physically important energies 

With the various representations for the energy of a 
cell, we next examine physically significant values of the 
time-averaged scaled energies E^, T^,, W^. and V and 
relations among them. Equation ( 13 ) indicates that 



T.^,[E^,] is double valued for E^, < with a minimum 
at T* = (VW - 3)^/^ » 0.54, and is single valued for 
T* > 1. Leong & Saslaw (2004) give a detailed discus- 



sion of the different regimes of the T^,[E^,] relation, so 
we focus on the physically relevant cases here. We plot 
some of the relationships among these quantities in figure 
[T] and indicate some interesting values. 

At El. > 0, most galaxies in the cell are unbound and 
the average speed of these galaxies is greater than the 
escape speed. This corresponds to tp < 0.5 and describes 
the case where clusters are unlikely to form in the cell. 
Galaxies are mostly bound in clusters when E^, < 0, 
corresponding to T^ < 1, tp > 0.5 and —W^ > T*. At 
El. = 0, ip = 0.5, —W^ = T* and the kinetic energy is 
equal to the absolute value of the potential energy. 
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Figure 1. (a): The ensemble average normalized energy Et, as a function of the ensemble average normalized temperature T*. The 
vertical dotted line separates the region where T* < 0.54, corresponding to the negative specific heat branch of _E*[T*], from the region 
where T* > 0.54, corresponding to the positive specific heat branch of Et[Tt]- To the right of the dashed lines, Et > and most of the 
galaxies in the cell are unbound, (b): The ensemble average normalized virial ratio '0, as a function of the ensemble average normalized 
temperature T,. This has the range < ip < 1. In comparison, T« may become very large, (c): The ensemble average normalized energy 
Et, as a function of the ensemble average normalized virial ratio tp. E« is negative for i/i > 0.5 and has a minimum at ijj = (vTO— 2)~^ Ri 0.86 
which is marked by the vertical dashed line. 



The state where T^, — has no average pecuUar ki- 
netic energy. This occurs only when th e entire universe 
has c ollapsed into a single supercluster (Baumann et al. 
20031. For an individual cell, this state is unphysical and 
hence not in quasi-equilibrium, which suggests that there 
is a minimum value of T* > with a corresponding value 
ofE*^ 

At T, = (%/lO- 3)1/3 « 0.54, E^T^] is a minimum. 
This corresponds to V' = (2 + VW)/6 sa 0.86. We can 
describe the physical characteristics of this point using 
the specific heat of the cell, given by 



Cv^ 



1 dE 
Ndf 



3 dE, 
2W\ 



2^ 



6r;- 1 



(19) 



T* = 0.54 is a zero of Cy and the specific heat of the 
cell is negative for T* < 0.54 or -0 > 0.86. Therefore a 
cell with T^, < 0.54 is a special case because it is suffi- 
ciently condensed that it is unlikely to ex change galaxies 
and e nergy with the rest of the universe ( Saslaw & Yang 
2010 1. Such cells can be approximated by closed systems 
that continue to relax independently. They form a set of 
microcanonical ensembles. Other cells, with T* > 0.54, 
may still exchange galaxies and energy with the rest of 
the universe and thus are part of a grand canonical en- 
semble. For this reason, we use a grand canonical ensem- 
ble for our analysis. 

Because the T^[E^] relation is double- valued with a 
transition at T* = 0.54, we describe the branches sepa- 
rately. We refer to the branch where T* < 0.54 as the 
negative specific heat branch since cells with T^, < 0.54 
have negative specific heat, and correspondingly the re- 
gion where T* > 0.54 is the positive specific heat branch. 

2.2. The limits ofT^ 

Because the theory described in this paper is valid 
for quasi-equilibrium, we examine the conditions needed 
for a region to be in quasi-equilibrium. The require- 
ment is that the macroscopic evolution of a region in 
quasi-equilibrium be slow compared with the crossing 
time of the system so that equilibrium prevails approxi- 
mately. Regions that are not in quasi-equilibrium are un- 
stable. They generally will restructure internally to reach 
a quasi-equilibrium state on a timescale that is shorter 
than the evolution of a quasi-equilibrium system. 

The condition that a system be in quasi-equilibrium 
places limits on T* and tp. These limits define the range 
for the quasi-equilibrium thermodynamic theory to ap- 
ply. Since we have established that T* = is an extreme 
and unlikely case, we seek physically motivated limits on 
T*. We denote these limits by T^min and r*.,„ax with 
corresponding values for ip. 

In the high temperature limit with large T* , the kinetic 
energy dominates over the potential energy so the system 
can be described essentially as an ideal (non-relativistic) 
gas. This means that systems with large T* are physi- 
cally possible and can be described simply by the well- 
studied case of an ideal gas. Therefore T^^max = oo and 
V'min = ^OT the case of an ideal gas. 

In the low temperature limit, T* = is an extremely 
unlikely case (occurring only in static cosmologies) so the 



quasi-equilibrium lower bound on T* is nonzero. Con- 
straints on r*,,„in thus have to come from the conditions 
for quasi-equilibrium for cells on the negative specific 
heat branch of the E^,[T^,] relation. 

For regions to have negative specific heat, they must be 
a microcanonical ensemble that has separated out from 
the grand canonical ensemble that describes the rest of 
the universe. Because such regions are closed systems, 
they generally contain clusters that are so tightly bound 
such that they do not exchange energy or particles with 
the rest of the universe. This also means that the energy 
of an individual cluster remains nearly the same as when 
it first separated out of the grand canonical ensemble. 
Therefore the quasi-equilibrium grand canonical ensem- 
ble can provide initial conditions for the formation of a 
set of microcanonical ensembles. 

To determine the co nditions for quasi - equili brium, we 
use equation (3.24) of Leong & Saslaw (2004) to relate 



T* to the mean squared velocity of galaxies in a cell (w ), 
the dynamical timescale of the cell Tdyn ~ iGp)~^/^ and 
the average nearest neighbor separation between galaxies 

(r) -- n~i/3 by 



T,= 



n^/^iv^) 



0.0881 (v^)T|y„ 



3(3/2)i/3^i/3Gp (1/3 {ry 



(20) 



In approximate virial equilibrium, (r)^/(u^) w '''dyn' 
which sets the lower bound for T^,. For C ~ 1: this sug- 
gests the minimum value of T* is T^min w 0.1, corre- 
sponding to -E*,min ~ —0.1 and V'max > 0.999. With 
these limits we can now determine the probability that a 
cell has a given energy. To do so, we use a grand canon- 
ical ensemble of cells since cells are essentially open re- 
gions of space that are not enclosed by an insulating wall 
and hence may exchange particles and energy. 

3. PROBABILITIES 

The probability that a cell with N galaxies in quasi- 
equilibrium in a grand canonical ensemble has total en- 
ergy E is gi ven by the usual result from statistical me- 
chanics (e.g. Leong fc Saslawp004 ) 



-E/To NtJ./To 

P{E,N)dE^g{E) dE 



(21) 



where g{E) is the density of states having energy E, and 
To, ^ and Zq are the temperature, chemical potential 
and partition function of the grand canonical ensemble. 
Here we use units of temperature where the Boltzmann 
constant is 1 so temperature has energy units. 

To simplify the analysis, we can separate E and N and 
express the probability in equation (21 1 as 



PN{E)dE = fv{N)P{E\N)dE 



fv{N)g{E)- 



,-B/To JV^/To 



-dE 



(22) 



where P{E\N)dE is the conditional probability that a 
cell has energy E given that it has N galaxi es, and 
/v(7V ) is the counts-in-cells distribution given by Ahmad 
et al. ( |2002 ). This allows us to work with the subset of 
cells with the same value of N, which projects the grand 
canonical ensemble into a canonical ensemble. 



6 



Using the scaled thermodynamic vari ables we deter- 
mine the various factors in equation (21 ). From statisti- 
cal mechanics, the density of states g(E) for an ensemble 



9iE) 



dE 



d{e^ 
dE 



(23) 



where il is the number of microstates and S is the en- 
tropy of the ensemble. For a canonic al ensemble in quasi- 
equilibrium, the entropy is given by (Ahmad et al.|2002| 



iVln 

5N 



N 



YT3/2 



A^ln(l 



t;') 



3NT, 



1 + T 



-3 



3iV 



, /27rm\ 



(24) 



where T is the unsealed kinetic temperature of a cell in 
units where the Boltzmann constant is 1, T^ is the volume 
of a cell, N is the number of galaxies in the cell, m is the 
mass of a ga laxy and A is a normalizing factor. Since 
equation (12 1 relates E^, to T* we can write the number 



of micr ostates QjE^,) for scale d energy E^, in terms of T* 
as (c.f. |Leong fc Sa5aw|[2004l ) 



fl[E^, [T^,] 



V 

N 



l + T. 



N / . 3W 

/ 2TTmT \ ^ 



V A2 ; 






T\Yeng{E^) is 



(25) 



9{E,[T*]) 



dn dT^ 



dT^, dE 
3iV 



> 



(l+T-)l" 



/27rmT\ ' ^- 



\ A2 



3iV 
1 + T^ 



(26) 



The other quan tities in equation (21 ) arc found in Ah- 
mad et al. (2002). For the grand canonical ensemble, the 
fugacity and partition function are 



Nil 

e'^0 — 



V 



,t; 



N 



N -Nb 



and 



{l-hre 



Zg = exp [7V(1 



2TTm 



(27) 



(28) 



where b as in equation ([8| is the clustering parameter for 
the grand canonical ensemble, N is the average number 
of galaxies in a cell, and Tq is the temperature of the 
gran d canonical ensemble. Substituting equations ( 26 ) , 
(27) and (28) into equation (21), the terms involving A 
and m cancel and we get 



P{E4T,]\N)dE, 



3N N 



T 



3/2^ 



N 



{i + T,y'{i-b) 



\N 



X exp 



5N 



^-Nb-N{l-b)-^ 



l + T. 



To 



dE^. 
(29) 



To write equation ( 29 ) in terms of T* we solve for T/Tq 
and E/Tq. EquationsflOl) and M give 



Tn = 



l-b 



PnCie/Ri 



1/3 


'l-b ' 


[ b 1 



1/3 



Using equation (11) we get 



T 



T, 



1 



1/3 



and from equation (12 1 we get 



E 
To 



2 *-3 



T 



l-b 



1/3 



(30) 



(31) 



(32) 



Substituting equations (31 1 and (32) into equation (29), 
we get 



P(E4T,]\N)dE, 



(l + T^fil-b) 




N 



(33) 



which gives the differential conditional probability that 
a cell has scaled energy E^, given that it has N galaxies. 

3.1. Change of Variables and Normalization of 

P(E4T,]\N) 

The probability that a cell has a scaled energy in the 
range -E*,! < E^, < E^,,2 comes from integrating over the 
relevant range so that 

Pn{E,.i <E,< E,^2) = / *' P{E,^,]\N)dE, (34) 

which is normalized by integrating over all possible values 
ofE* 



P(E^[T^]\N)dE^ 



(35) 



Because E^\T^,] has a double valued regime for E^ < 
0, the integrals in equations (34) and (351 are taken by 
integrating over both the positive and negative specific 
heat branches. We can split the integral such that 



£..: 



P{E4T,]\N)dE^ 



P-{E,\T,]\N)dE,+ _ P+{E,\T,]\N)dE, 

(36) 

where P- and P+ denote probabilities for the negative 
and positive specific heat branches of T*[i!^,] respectively. 



These ranges are also subject to the quasi-equiUbrium 
hmits such that f * > 0.1 and E^ > -0.390. 

To simphfy the analysis, we can rewrite the probability 
in terms of T* and ip. The change of variables to rewrite 
the probability in terms of T^ is 



P(T,\N)dT, ^P{E4T,]\N) 



dE^ 



dT^ 



dTt, 



where we take the absolute value of the Jacobian 



dE^ 
dT^, 



T, + 6r, - 1 

{i + Tly 



(37) 



(38) 



because it is negative in the negative specific heat branch 
where T* < 0.54. The probability is therefore 



N 



P{T^\N)dn =N e 



-N(l-b) 



t: + err - 1 



{i + ry 



37V 

2fr 



— 3 

^(1 + T:')(1-6) 



N 



X exp 



(39) 



N{l-b) + 



3NT^-l 



3 



Tib 
1-6 



1/3N 



T. 



dTf, 



1 



where we have factored out terms containing N . These 
terms will cancel out when the probability is normalized 
because they do not depend on T* . The integral in equa- 
tion ( 36 1 becomes 



Pn{T.,i <T,< T,;2) 



P{T,\N)dT, (40) 



T. 



which includes both the positive and negative specific 
heat branches. The normalization factor follows as 



P{T.,\N)dT., 



(41) 



which is simpler to evaluate because T^ is single-valued 
throughout the domain of in tegration. 

Similarly to equation (37), we write the probability in 
terms of ip as 



P{'4>\N)dtlj ^ P{T,\N) 
^P(E,\N) 



dT, 
dE, 


dip 
dT, 


dT, 


# 



(42) 



di; 



where we take the absolute value of the Jacobian 

dE^ _ 1 + 0- Q^^ 

# 3(l-V^)2/3^'*^^ 



(43) 



because it is negative f or t he negative specific heat 
branch. Using equation (171 we write the probability 



in terms of ^ to get 

-N 



P{iP\N)di:^N e-^^^-''^ 



l + Aip-6ip 



3(1 - V)2/3V'' 



4/3 



^ 



X exp 



1/3 



I / (1 ~b)b_ 



3N 

N 



(44) 



^(l-^') + ^(l-2^) 



X 1 



il-ij)b 



V.(i - b) 



1/3N 



dV- 



The probability that a cell has a virial ratio in the range 

V'l ^ ^ ^ "02 is thus 

PNi^i < ^ < ^2) = / ' P(iP\N)d^ (45) 



which is normalized by 



P{'iP\N)dip. 



(46) 



'Amin 
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and 



Here the T* and ■0 representations (equations 
44 ) of the probability essentially describe the same" sys 
tem, but provide insights into different physical proper 
ties. Using T*^,ni„ ~ 0.1 and T.^,ala.x = 00 as discussed 
in the previous section, we normalize and plot P{T^\N) 
and P(V'|7V) in figured 

Alt hou gh the unnormalized probabilities given in equa- 
tions ( 39 ) and ( 44 1 contain factors involving N, these fac- 
tors of N are independent of T* and iJj and thus cancel 
out in the normalization. Th erefor e w ith the normaliza- 
tion given by equations ([4T|) and ( 46 1 , the probabilities 



in equations (W0| and ( |45| are independent of N. 

From figure [zj the probabilities are mostly concen- 
trated in the negative specific heat branch, suggesting 
that galaxies in a cell are very likely to form a cluster 
that separates out into a microcanonical ensemble. This 
shows up especially in the case for A'^ = 15 where the nor- 
malized probabilities for the positive specific heat branch 
are on the order of 10^^. In general, denser cells with 
larger N are more likely to have bound and collapsing 
clusters because the more galaxies there are in a cell, the 
more likely they are to form a cluster. 

3.2. The probability of a bound cluster 

From the probability that a cell has a given energy, we 
can calculate the probability that its galaxies are bound, 
and the probability that it has a negative specific heat. In 
these cells, the absolute value of their internal potential 
energy is greater than their kinetic energy so that most 
galaxies do not have the escape speed. Thus E^ < 
which corresponds to T* < 1. The probability that a cell 
is bound is therefore 



P 



bound 



{N,b) 



J^^^^^^P{T,\N)dT, 
ff^ . P{T,\N)dT, 



(47) 
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Figure 2. Top row: P{Tt) for different values of A'^ and b plotted as a function of T». The left panel is for N = 5 and the right panel is 
for N = 15. Bottom row: P{'tl') for different values of N and b plotted as a function of -0. The left panel is for N = 5 and the right panel 
is for N = 15. All plots are independent of A'^ because N cancels out in the normalization. 



and the probability that a ceh has negative specific heat 
is 



PnegiN,b) 



/|° . P(T,\N)dT, 



(48) 



The probabihty that a cell has negative specific heat 
given that it is bound is similarly calculated from 



rO.54 „/?s 



f^-°" P{T,\N)dT, 
-„eg|bou„d(iV, b) = r- (49) 



We use r*^,„in = 0.1 as discussed in section 2.2 and plot 
these probabilities against N for different values of b in 
figure [3] For dense cells with N greater than about 10, 
the galaxies in a cell are likely to be bound in a cluster 
with negative specific heat. This suggests that such clus- 
ters may already have collapsed into a microcanonical 
ensemble and their internal energies are mostly inacces- 
sible to the rest of the universe. Such clusters are also 
more likely to be relaxed than clusters with fewer galax- 
ies because they are denser and have a faster relaxation 



timescale. Unlike Pncg and Pbound, 



ncglbound 



decreases 



as b increases. This is because as b increases, cells are 



relatively more likely to be bound than they are to have 
negative specific heat. 



4. ENERGY AND SHAPE 

Having determined the probability that a cell has a 
particular energy, the next step is to relate energy to 
shape. The most complete description of shape gives 
the detailed positions of all the galaxies in configuration 
space which involves 6iV independent variables. We can, 
however, take advantage of a number of properties of the 
galaxy distribution to simplify the problem. 

First, the mass dependence is contained in the factor 
(3 = {?>/2){Ginn?Y which scales to describe the quasi- 
equilibrium clustering of a system in which all objects 
have the same mass. This allows us to describe a hierar- 
chical model of clustering with different mass "particles" 
at each level of the hierarchy by rescaling /?. 

Next, because halos, galaxies and clusters of galax- 
ies that are mostly relaxed and have a virial ratio 
greater than about 0.86 have a negative specific heat, 
they approximately form a set of microcanonical ensem- 
bles. Hence their internal energies are largely inacces- 
sible to the rest of the universe. In particular, small 
dense groups are more likely to be virialized than larger 
clusters because their dynamical timescales, given by 
Tdyn oc {Gp)~^''^ are shorter. This allows us to describe 
virialized groups of galaxies as individual particles that 
can be treated as modified point masses. Finally, merg- 
ing particles that are bound to each other can be treated 

This 



2011) 



as a single extended particle (Yang et al. 

allows us to describe merging objects in tne statistical 

mechanical theory. 

These properties mean that we can represent many 
clusters of galaxies as groups of mostly virialized subclus- 
ters. The shapes of such clusters are thus given by the 
positions of their subclusters which reduces the many- 
body problem of describing all the galaxies in a cluster 
to a few-body problem that describes how subclusters 
interact with each other. Based on the probability that 
a cell is not completely bound and virialized as discussed 
in the previous section, we expect that most cells will 
usually contain fewer than about 10 of these subclusters, 
depending on the value of b. Some cells, however, can be 
dominated by a single virialized cluster. 



4.1. The Potential Energy of Subclusters 

Subclusters can be described by the positions of 
their individual galaxies with respect to the subcluster's 
center-of-mass and the positions of these centers-of-mass. 
To illustrate this, we use an example of a cluster com- 
posed of two subclusters. The total potential energy of 
such a cluster is given by the sum of the internal poten- 
tial energies of its subclusters and their mutual potential 



energies such that 
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(50) 

where the superscripts indicate particles from different 
subclusters. The first two terms are the internal potential 
energies of the subclusters, which are inaccessible to the 
rest of the ensemble if these subclusters are virialized. 
The last term represents the potential energy between 
the subclusters. We can write this term as 



$(1.2) 



^ I (1) 

i(l),j(2) Xj 



^ (1) (2) 



,(2) 



' E 



Gm] m\ 

'' J 



.(1,2) 



(51) 



-■(1,2) 



.(1) 



.(2) 



where r^^'^) jg ^j^g separation between the centers-of-mass 
of each subcluster. The detailed internal structure of 
the subclusters modifies their interaction potential. This 
modification is 



K{r 



(l,2)^ 



M(i)M(2) ^ 



(1) (2) (1 21 



.(1) 



.(2) 



(52) 



where M^^-' and Af (2) are the masses of subclusters 1 and 
2 respectively. Then the potential between subclusters is 

where K(r(^'2)) describes the modification to the point- 
mass potential by a pair of extended structure s. This 
modification term enters as a coefficient to /3 (Ahmad 
Yang et al. 20111 and if K(r(^'2)) is close to 



et al 

unity 



2002 

its effect will be small. 



We can write the positions of galaxies as the vector 
sum of the position of their subcluster's center-of-mass 
and their position within the subcluster: 



.(1) 



.(1) 



-i« 



(54) 

where x*^^^ is the center-of-mass of subcluster 1 and x^ ' 
is the position of particle i with respect to the center-of- 
mass of subcluster 1. With this notation, the modifica- 
tion term in equation (51 1 has the limits 



.(1,2) 



1 > 



,(2) 



> 



^(1,2) 



"(1,2) 



i(^) 



-(2) 



(55) 



In the limit where |Xj- — xj | <^ r(^'2) the modification 
term is 1. This means that we can approximate subclus- 
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Figure 3. Probabilities for different values of 6 plotted as a function of A'^ using T* niin = 0.1. (a): Probability that galaxies in a cell are 
bound Pbound- (b): Probability that a cell has negative specific heat Pneg- (c): Probability that a cell has negative specific heat given that 

it is bound Pncg|bound- 



ters that are widely separated as point masses when com- 
puting their interaction potential. For subclusters that 

are touching, fi{A^''^^) w 0.5 because |x^ ' — x^- | is on 
average the radius of a subcluster. Subclusters that are 
closer to each other may have a smaller value of K(r^^'^^), 
but such pairs may be merging, in which case we can 
treat such pairs as single subcluste rs with a different in- 
ternal structure ( Yang et"ar||2011 ). 



This means that the many-body problem of studying 
all the galaxies in a cluster is reduced to the few-body 
problem of studying the positions and velocities of the 
subclusters in the cluster. This makes the problem con- 
siderably easier, since there are fewer "particles" to deal 
with. Because clusters with more than 10 particles are 
likely to be virialized, we consider the case where cells 
have less than 10 subclusters. These cells have a non- 
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negligible probability of having a positive specific heat. 

Although subclusters may have different masses and 
different internal structures, there are analyses that take 
into account these more general cases. The analysis for 
particles of differ ent internal structure is described in 
Yang et al. (2011 1, and as suggested by equation (52 ) en- 
ters as a coemclent to /3. The analy sis for multiple masse s 



is considerably more complicated (Ahmad et al 



2006ah 



so to simplify it we make the reasonable approximation 
that "particles" have the same mass. In Appendix [A| we 
show that the masses of individual particles are typically 
within an order of magnitude of each other. 

4.2. The Detailed Configuration of a Cell 

The detailed configuration of a cell is directly related 
to its energy. We consider its potential and kinetic en- 
ergies separately, and relate them to W* and T*. We 
work with the instantaneous values of the energies since 
these quantities are well-defined and can be determined 
in principle by taking a snapshot of a cluster at a given 
time. In this section, we refer to the basic object in a 
cell as a "particle" , bearing in mind that it may describe 
individual galaxies or virialized subclusters. 

4.2.1. Potential Energy 

The potential energy in the cell can be separated into 
a local and background component such that 



$ = $ 



local 



$ 



background 



(56) 



where $iocai is the potential energy that arises from pairs 
where both members of the pair are within the cell and 
'^background IS the potential energy that results from pairs 
that have one member in the cell and the other member 
outside the cell. 

The local potential energy in the cell comes from sum- 
ming up the potential energy of all pairs within the cell 
such that 



$ 



Gm^ 



local 



l<i<j<N 



K{r,j) 



(57) 



where G is the gravitational constant, m is the mass of a 
galaxy, r^- is the distance between particle i and particle 
j and the sum is taken over all possible pairs of galaxies 
within the cell. Here K{r) describes the potential of par- 
ticles that are extended sources and is a modification to 
the potential used to describe point masses. It is related 
to C(e/i?i) through equation ([6]) and is usually of order 
unity for particles that are small compared to the cell 
size. 

We can write the local potential in terms of the average 
inverse separation of galaxy pairs so that 



$ 



local 



-Gm 



,N{N-l) /K{r) 



(58) 



where the N{N — l)/2 factor is the number of unique 
pairs in a cell with N particles. Here (l/r) is the average 
inverse pairwise separation of all galaxy pairs in the cell. 
In an expanding universe, the expansion of the uni- 
verse exactly cancels the smoothed background term in 
the potential ( Saslaw fc Fang|[l996 ) so we can ignore the 
background term for cells that are larger than the scale 



at which the two-point correlation function ^2 is negli- 
gible. At these scales, the potential energy is extensive 
since the expansion of the univer se cancels out th e back- 
ground term. For smaller cells, Saslaw & Fang ( 1996 1 
suggest that extensivity is also a good approxirnation 
in the regime where ^2 ^ 1 so that the correlation en- 
ergy within a cell is much greater than the correlation 
energy between cells. This approximation holds because 
the form of the partition function is independent of scale. 
The scale dependence is provided by the clustering pa- 
rameter h which is r elated to the two-poi nt correlation 
function ^2 through ( Saslaw fc Fang|[T996 ) 



b = 



2TrGm^ 
3T~ 



S^2{r)rdr. 



(59) 



For these reasons, the potential energy in a cell is ap- 
proximately extensive regardless of its radius, and the 
local potential $iocai is a good approximation for the cor- 
relation potential energy. This also lets us use R instead 
of i?i in the scaling factors A and a 

For an individual cell, the potential energy, W , is thus 



H/ = $ 



local 



-Gm- 



,N{N^\) 



nir) 
r 



(60) 



and using equation (pi) , we can write the scaled potential 
energy as 



W. 



2Gm^ N{N -1) / K{r) 
' iNA 2 



4 {N~l)^/K{r) 



9NC{e/R) \ r 



(61) 



where we have used the scaling factor A given by equa- 
tion (|4| because we are dealing with the detailed config- 
uration of the galaxies in the cell. Here, the Gni^ factors 
cancel and W^, is determined only by N, R and {K{r)/r). 
For a homogeneous cell, the average inverse separation 
of all pairs of particles scaled to the cell radius is 



R 



honi 

We can then write 

N 



R 

N 



^1. 2j 3 

47rr dr — -. 

V r 2 



N 



I ^/ n{r) 



R 



ru ^^^2^ 

horn. 



(62) 



(63) 



where 77 is a form factor that describes the shape of a 
cluster of galaxies through its configuration and the size 
and mass of each particle's individual halo in comparison 
to a homogeneous cell. This gives 



W, = 



v 



where 



4 1 
3iV2 



3 a^iR) 

Tij/R' 



E 



(64) 



(65) 



l<i<j<N 



This is defined only for cells with N > 2 because rj is not 
meaningful where A^ = or A'^ = 1 since there are no 
other particles in the cell to form a cluster. 
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4.2.2. Kinetic Energy 
The kinetic energy of a cell is defined as 



K = 



V^ 1 ., 1 , n 



^^ 2 



"^««^ = 2^"^ ^^*^^ 



(66) 



This depends on v"^ which depends on the units used to 
describe time. A suitable choice for the time unit is a 
representative dynamical time for the cluster, given by 



Tdyn = (Gp) 



-1/2 _ 



fl3 

GmN' 



(67) 



For V given in terms of the dynamical time, the kinetic 
energy is therefore 






(68) 



and from equation Q we get 
2K m{v^) 4 



T* 



1 



SNA 3A 9 R^C,{elR) 
We can write the velocity as 
1 



{{v^T^ynf). (69) 



i?2 



((w«Tdy„)^) 



(70) 



so that v^ is the mean-squared velocity, given in units of 
cell radii per dynamical time. In terms of v, we get 



T, 



dCie/R)- 



(71) 



4.2.3. 



Total Energy and Virial Ratio 

The scaled ei ierg y of the cell is given by adding equa- 
tions (feil) and fnT to get 



E^=T^+W^ = 



9 a^/R) 



2 V 

3 C(e/i?) ■ 



(72) 



Likewise, the virial ratio ?/; is given by the ratio between 
W^ and T* such that 



^ = 



2n 



III 

4y2- 



(73) 



We next compare these quantities to the constraints 
on Et, , T* and W^ given in section [2J The first two 
constraints are T« > and W^, < which come from 
the definition of the kinetic and potential energies. The 
other cons trai nts come from quasi-equilibrium following 
equations (13) and (14 1 such that < -0 < Ij 



ly* > 



-2^/3 
3 



-1.058 



and 



E^ > 



{VTO - 4) (\/lO - 3) 
10-2 



1/3 



-0.393. 



(74) 



(75) 



The first two constraints are hard limits that also apply 
to the instantaneous values so that T* > and M^* < 0. 
The next two constraints given in equations ( 74 ) and ( 75 1 
only apply to cells in quasi-equilibrium, but suggest that 
the values of W* and T* are not likely to deviate very far 



from this range since such cells will quickly relax into a 
quasi-equilibrium state. This places limits on r/ and v. 
The constraint given by the limit on W^, in equation 



( 74 ) suggests that for a cell to be in quasi-equilibrium, 
galaxies cannot be too close to each other because the av- 
erage pairwise inverse separation (1/r) would become too 
large. This is closely related to the constraint that ip < 1 
which comes from the fact that cells where —W> 2K are 
rapidly collapsing and hence not in quasi-equilibrium or 
virial equilibrium. Particles in such cells are likely to be 
close to each other. These particles will experience close 
encounters resulting in mergers or strong accelerations 
that substantially change their velocities in a short time. 
In both cases, the phase space configurations of such cells 
are highly unstable. This provides a strong constraint on 
the spatial configuration of a cell, and therefore we ex- 
pect few cells with arbitrarily dense cores unless they are 
already relaxed in virial equilibrium. 

4.2.4. Subclusters and mergers 

Galaxies and clusters are known to merge when they 
get close. Gravitational effects such as dynamical fric- 
tion, tidal forces and slingshot ejection transfer energy 
from their orbits into the internal energies of their parti- 
cles and their orbits merge as their constituent stars or 
galaxies relax into a single entity. 

Ear ly studies of merging galaxies (e.g. 



1979 Aarseth & Fall 



1980 



Garcia-Gomez et al. 



Roos & Norman 



suggest that the criteria tor galaxies to merge depend on 
the masses and radii of the progenitor galaxies, and their 
relative velocities, sp i ns, an d separations. For example, 
Garcia-Gomez et al. (19961 obtained a merger criterion 
from empirical fits to W-body simulations giving 



(mi +m2)rp 
2.5(miei +^2^2) 



virp) 



1.18fe(rp) 



< 1 



(76) 



where mi and 7712 are the respective galaxy masses, ei 
and 62 are their half-mass radii, Vp is the minimum sep- 
aration between the galaxies, and v{rp) and Ve{rp) are 
their relative velocity and escape veloci ty at minimum 
separation. A similar merger criterion by [Aarseth fc Fall 
(1980) is 



2.6(ei 



^2) 



v{rp 



1.16Ue(''p 



< 1. 



(77) 



These parameters indicate that galaxy and cluster merg- 
ers are complicated events that depend on both the in- 
ternal and external dynamics of the merging clusters. 
However, the terms involving rp in the merger criteria 
suggest that the separation between galaxies cannot be 
arbitrarily small or a merger will occur regardless of their 
mutual velocities. Qualitatively, this means that there is 
a minimum separation between pairs of galaxies or clus- 
ters, and this minimum separation is probably closely 
related to the masses and radii of both galaxies in the 
merging pair. 

The different merger criteria suggest that there is no 
simple dependence on the minimum distance between 
galaxies. Indeed, binary galaxies merge because gravi- 
tational effects such as dynamical friction and tidal in- 
teractions transfer orbital energy in a tight binary galaxy 
into the internal kinetic energy of their constituent stars. 



13 



The efficiency of this process is known to depend on both 
the dynamics of the encounter and the internal rotation 
of the merger compon ents (e.g. Toomre fc Toomre|1972 



D'Onghia et al.|2010 1 and hence a wide variety ot condi 



tions may result m a merger. For this reason, we do not 
quantify the minimum distance between galaxies or sub- 
clusters, but note that they are not collisionless and will 
merge if they get too close to each other. Hence there 
will be a minimum distance between "particles" in a cell. 
Even though merging requires time, merging subclus- 
ters, being denser than the entire cluster, will have a 
shorter dynamical time r^yn than the entire cluster. This 
means that the merging subcluster can relax to become 
a single subcluster faster than the overall cluster can re- 
lax. Therefore, a stable cluster configuration cannot have 
more than about 10 subclusters, as we have shown ear- 
lier, and these subclusters must be sufficiently spread out 
that they are unlikely to be merging. 

5. COMPARING DIFFERENT SHAPES 

From the previous sections, we can narrow the proper- 
ties of a quasi-equilibrium cluster. Such clusters can be 
decomposed into subclusters whose masses are within an 
order of magnitude of each other. Clusters that have a 
positive specific heat are likely to have less than 10 sub- 
clusters, and these subclusters are spread out through- 
out the cluster. Clusters that have a negative specific 
heat are more likely to occur than clusters with a pos- 
itive specific heat, and they are nearly virialized in a 
mostly closed system. These virialized clusters will have 
a roughly spherical or ellipsoidal shape, unlike clusters 
with a positive specific heat which may be more irregu- 
lar. 

The small number of "particles" in a cell suggest that 
an effective description of a cell and hence of a cluster of 
galaxies is the number and detailed positions of its sub- 
clusters. Using this description, compact clusters have 
a single dominant subcluster, medium compact clusters 
have multiple subclusters and loose clusters have a single 
diffuse subcluster that is most likely to be unvirialized. 

Based on this classification, compact clusters are best 
represented as spherical with a density profile. Loose 
unvirialized clusters are likely to be rare, and have fewer 
galaxies. Such clusters are better represented as a col- 
lection of galaxies. Finally, medium compact clusters, 
having multiple pronounced concentrations, are best de- 
scribed as a collection of nearly virialized subclusters. 
With this in mind, we can decompose a rich cluster into a 
small number of nearly virialized subclusters. These sub- 
clusters are simple, nearly spherical objects with density 
profiles which are easily characterized. Sparser regions of 
space may be similarly decomposed although they may 
have different scales. 

5.1. Probability as a function of virial ratio 

From the detailed positions and velocities of the sub- 
clusters in a cell, we can calculate the instantaneous en- 
ergy and virial ratio of the cell. These energies repre- 
sent a snapshot of a cell that is more closely related to 
its local dynamics than to its average thermodynamics. 
The energies fluctuate about the quasi-equilibrium en- 
semble averages as particles move about the cell. There- 
fore a measured instantaneous virial ratio corresponds to 



a range of quasi-equilibrium virial ratios. These fluctu- 
ations have been measured to be up to 20% in TV-body 
simulations (Aarseth & Saslaw 1972), so we use a range 
of 20% about the measured virial ratio. 
This gives us a range over which to integrate equation 



( 45 ) , so that the probability that a cell has a measured 
virial ratio ip is 



.1.2V _ _ 

PnW = / fv{N)P{i:\N)d^. 



(78) 



This probability, although related to the configuration 
of a cell, is not the probability that a cell has a similar 
configuration. This is because different conflgurations 
may have the same energy and virial ratio. In fact, be- 
cause galaxies in a cell move about, the detailed spatial 
conflguration of a cell is not static. For this reason, we 
assume that all configurations with the same virial ratio 
are equally probable because they are likely to be chance 
occurrences as galaxies and subclusters move around a 
cell. Under this assumption, we can compare different 
conflgurations and study how much more likely a cell is 
to have one configuration rather than another. 

For a cell in which we can measure both T* and W^ , the 
measured virial ratio tp is easily computed. However, for 
a cell in which we only have the spatial configuration (e.g. 
observations with no radial velocity information), we 
need to make additional assumptions in order to infer a 
value of T* . Since cells that are not in quasi-equilibrium 
are likely to relax into quasi-equilibrium rapidly, we may 
assume that the cell is in quasi-equilibrium to get the 
most prob able value of T*. This means that we can use 
equations ( 14 ) and ( [T8| to estimate T^ and f/; from an 
observed value of Wl. Ihus 



W, = 



2T, 



7-3 



and 



■0e 



W, 



2T, 



-'*,cst. + 1 



(79) 
(80) 



from which we can use equation ( 78 ) to calculate a prob- 
ability. 

Another application of the range of probabilities is to 
determine if a specific configuration is likely to be bound, 
unbound or virialized. This is useful for many cases. 
However, in the case 0.86 < ip < 0.999 a cluster has a 
negative specific heat because most spatial conflgurations 
have a value of "0 that falls within the negative speciflc 
heat branch of E^, [4>] in figure 111 Then the variation in i/j 
means that we integrate over the entire negative specific 
heat branch. This suggests that when a cell is virialized, 
its spatial configuration can take almost any shape. 

5.2. Illustrative cases 

To illustrate this procedure, we compare the two highly 
idealized configurations of a line and a ring where we fo- 
cus on the spatial conflguration. To reduce the number of 
independent variables, we i gno re the kinetic energy infor- 
mation and use equations ( 79 1 and ( 80 ) to estimate the 
virial ratio. In these conflgurations, particles are regu- 
larly spaced so it is easy to compute the potential energy 
by a simple summation. The cases are different since the 
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Figure 4. Value of Wt, for the line and ring configuration. The 
shaded area indicates the region where W* is too nega tive fo r the 
cluster to be in quasi-equilibrium as discussed in section [4.2.3| The 
value of Wf for both configurations becomes negatively infinite as 

N -^ oo. 



line configuration is a l-dimensional configuration, and 
the ring is a flat 2-diniensional configuration. 

5.2.1. Line configuration 

The simplest l-dimensional configuration is a line with 
particles spaced at regular intervals. For a cell with N 
particles and radius R, the spacing between each adja- 
cent pair of particles is 2R/{N— 1). This configuration is 
infinitesimally thin, and hence the axis ratios are infinite. 

Because particles are arranged in a line, particle i has 
i — 1 particles to its left and N — i particles to its right. 
Therefore W^ obtained from equation ( 64 ) is 
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(81) 



5.2.2. Ring configuration 

A simple case for a 2-dimensional configuration is 
where particles are arranged in a ring about their com- 
mon center-of-mass. The particles are spaced evenly 
around the circumference of a circle so that the distance 
between particles i and j is 2i?sin(7r(i — j)/N). 

From the symmetry of the ring, the total potential en- 
ergy on any particle in the cluster is the same for all 
particles. Therefore W^, for the ring configuration is 



W, 
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(82) 



To compare these cases, we plot W^. against N in figure 
|4] which shows that that a line configuration cannot be in 
quasi-equilibrium when iV > 18. This suggests that fila- 
ments cannot be arbitrarily long because filaments that 



are too long are unstable and would quickly collapse into 
a different configuration. A similar case can be made for 
the ring configuration, such that the ring cannot be in 
quasi-equilibrium when N > 1563 but a cell with such 
a large value of N would have to be very dense or very 
large. 

Very dense cells are unlikely to contain a ring because 
in a very short dynamical timescale r^yn ~ {Gp)~^'^ 
they would have relaxed into approximate virial equilib- 
rium destroying any quasi-equilibrium ring structure. In 
addition, a large ring containing ^ 1500 galaxies is more 
likely to be primordial than to have been formed through 
quasi-equilibrium clustering. 

Although the ring and line configurations are highly 
idealized cases, the difference between them indicates 
that changing the axis ratio of a configuration has 
an effect on W^,. In particular, a 2-dimensional con- 
figuration allows for more distance between particles 
than a l-dimensional case, and hence a larger nearest- 
neighbor distance. This makes W^. less negative for the 
2-dimensional case than the l-dimensional cases since Wt. 
depends on the average inverse separation between par- 
ticles. 

This dependence on dimensionality illustrates how 
shape influences the potential energy of a cluster. For 
example, in the simplest case of a uniform cluster, a 
spheroidal cluster is likely to have a more negative value 
of W^ than a spherical cluster of the same mass and ma- 
jor axis. However, there are many shapes that have the 
same energy although it is possible to calculate the po- 
tential energy from the shape and distribution of galaxies 
in a cluster. Thus the internal structure of a cluster plays 
an important role in characterizing the potential energy 
of a cluster and a simple description of shape is likely to 
be insufficient to describe a cluster. Therefore we use the 
detailed positions of the galaxies or subclusters instead. 

To compare the line and the ring configuration, we 
calculate a n e stimat e o f tp for both configurations using 
equations ( 79 1 and ( [SO] ) where W^, is given by equations 
(81) and ^M for thFline and rin g re spectively. Using 
these values of "0, we use equation (78) to determine the 



probability that a cell has a similar virial ratio which is a 
necessary condition to have a specific configuration. To 
compare these two shapes, we take the ratio of the prob- 
abilities PAr(V'linc)/^Af(''/'ring) which tcUs US hoW mUCh 

more likely a cell is to contain a line than a ring. This 
ratio is independent of iV because the /v(iV) term can- 
cels out in the ratio. We plot PAr(V'iinc)/^Ar(V-'ring) for 
different regimes of V' in figure [5] 

Here, we do not plot the comparison for the negative 
specific heat branch because it covers a very small range 
of ip such that any configuration with a value of V' in 
the negative specific heat branch will include the entire 
negative specific heat branch because of the roughly 20% 
statistical variations over time of ^ in equation ( 78 ) . In 
addition, VF*['0] is double valued with a minimum at 
ip = 2/3, so for almost every value of W^ there is a 
value of tp in both the positive specific heat and neg- 
ative specific heat branches. This means that almost 
any spatial configuration is equally likely in the negative 
specific heat branch because there always exists a value 
of tp such that the integral in equation ( 78 ) covers the 
entire negative specific heat branch. The only spatial 
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Figure 5. Relative probability that a cell has a line configuration rather than a ring configuration for different conditions. Top left: Cells 
in quasi-cquilibrium. Top right: Cells with a positive specific heat. Bottom left; Unbound cells. Bottom right: Bound cells with a positive 
specific heat. 



configurations that are unlikely to occur in the negative 
specific heat branch are those with very negative values 
of M^*. Since the absolute value of W* depends on the 
inverse separation between particles, such configurations 
will have particles that are very close to each other, and 
therefore arc likely to be unstable to mergers. 

From figure [5l in many cases the line configuration is 
more probable than the ring configuration, and it is only 
in the case of small h or large iV that a ring configuration 
is more probable than a line configuration. This means 
that in most of the universe, we are more likely to find 
galaxies arranged in a line than galaxies arranged in a 



ring. This prediction is consistent with the observation 
that filaments are more frequently observed than galaxies 
or clusters arranged in a ring. 

6. CONCLUSIONS AND DISCUSSION 

In this paper we have described a theory of clustering 
that extends the work of Leong & Saslaw ( 2004 ) to calcu- 
late the probability that a cell has a given time-averaged 
virial ratio ■0. This probability is independent of the av- 
erage number of galaxies in a cell and depends on the 
clustering parameter 6, the number of galaxies TV in the 
cell and the range of virial ratios that the cell may have. 
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A feature of this theory is the use of subclusters to de- 
scribe the internal spatial configuration of a cell. These 
subclusters are concentrations of galaxies that are nearly 
virialized and share a common bulk velocity. The use of 
subclusters considerably simplifies the analysis because 
their masses are approximately equal and most clusters 
that are still assembling will have less than 10 subclus- 
ters. This is because cells with more than 10 subclusters 
are likely to already be virialized, and subclusters that 
are less massive than a tenth of the most massive sub- 
cluster do not have enough mass to significantly affect 
the potential energy. 

Because subclusters are collections of galaxies, the 
number of subclusters within a single cluster can be an 
indicator of its evolutionary stage. In hierarchical mod- 
els, clusters build up by the merger of multiple subclus- 
ters. Thus the presence of multiple clusters within a cell, 
or significant substructure within a cluster would suggest 
that mergers were relatively recent. Such clusters, hav- 
ing multiple subclusters, are classified as medium com 
pact c lusters in the classification scheme of 
( 1957). More evolved relaxed clusters have 



Herzog et al. 

ess substruc- 

ture with a single dominant concentration. Such clusters 
are their own subcluster and are classified as compact 
clusters under the same classification scheme. 

In addition, clusters with a negative specific heat 
may take on almost any configuration because W^, [ip] is 
double-valued with a minimum at ■0 — 2/3. Hence for 
almost every value of W^, there is a value of ■0 in the 
negative specific heat branch. The only spatial config- 
urations that are unlikely to occur in the negative spe- 
cific heat branch are those with very negative values of 
W^ . Since the absolute value of W* depends on the in- 
verse separation between particles, such configurations 
will have particles that are very close to each other, and 
therefore are likely to be unstable to mergers. Together 
with the roughly 20% statistical variation over time of -0 
about ip, any configuration with a value of "0 in the neg- 
ative specific heat branch may be a chance configuration 
of a cluster that has a negative specific heat. 

Since the GA'^-dimensional phase space configuration 
of subclusters in a cell is directly related to the cell's 
potential and kinetic energies and thus its virial ratio, 
we can use the theory to study the internal structure 
and shapes within a cell. To do this, we relate the en- 
ergy calculated from the observed instantaneous phase 
space configuration to the ensemble average energies in 
quasi-equilibrium. This highlights the fact that clusters 
of galaxies are not in strict equilibrium and hence their 
potential and kinetic energies fluctuate about the ensem- 
ble average energy. The spatial configurations of clusters 
change as a result of these fluctuations. 

Because the average thermodynamic quantities of an 
ensemble of cells in quasi-equilibrium change very slowly 
compared to the dynamical timescale of a single cell, the 



quasi-equilibrium energies are approximately time aver- 
ages. We use the spectrum of fluctuations to determine 
the probability that a cell may have a particular instanta- 
neous virial ratio. This is a necessary but not sufficient 
condition for a cell to have a given configuration. To 
compare different configurations, we therefore compare 
the probability that a cell has a virial ratio which could 
give rise to a given configuration. 

Based on the range of fluctuation s in virial ratio ob- 
seryed in iV-body simulations by [Aarseth &: Saslaw| 
£19721), we conclude that cells that have a negative spe- 
cific heat may have almost any spatial configuration. 
These configurations are likely to be chance configura- 
tions of fluctuating cells that are nearly virialized. Cells 
with positive specific heat have a range of energies cor- 
responding to fewer spatial configurations. In our com- 
parisons between line and ring configurations, the prob- 
ability that a cell will have a line configuration rather 
than a ring configuration varies depending on the clus- 
tering parameter b and the number N of subclusters in 
the cell. Therefore we present the following procedure 
for comparing the shapes of two clusters: 

The first step is to define a region of space that covers 
the cluster. The size and shape of this region of space 
define the cell size. Within the cell, identify the subclus- 
ters whose masses are within an order of magnitude of 
the largest subcluster. The positions of these subclus- 
ters are directly related to the scaled potential energy 
W^. of the cluster. From the observed VF* or the peculiar 
velocity information, estimate the virial ratio 0. 

The next step is to use the cell size to calculate the 
mean and variance of the counts-in-cells distribution of 
similarly-sized subclusters for a larger sample to obtain 
the clustering parame ter b (e.g. Sivakoff fc Saslaw]|2005 



Yang fc Saslaw|[20ll ). Then using the observed param 



eters tp and b, calculate the probability of finding a cell 
with a similar virial ratio using equation ( 78 ) . This prob- 



ability should then be compared to a reference cell that 
has a different structure to determine the relative prob- 
abilities of different configurations. 

Although the configurations that we use here are highly 
idealized, our comparison of a line and a ring config- 
uration is generally consistent with observations of the 
cosmic web. This simple test of our theory gives a rea- 
sonable result. We intend to make more detailed com- 
parisons with observations and more realistic cases which 
may include dark matter in a forthcoming paper. 
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APPENDIX 



MULTIPLE MASS COMPONENTS 

To illustrate the effect of having multiple mass components in a cluster, we consider the case where a cluster has 
two populations of subclusters. For simplicity, we assume that these subclusters are point masses, and the masses of 
each subcluster are mi and m2 with 7712 > ttt-i for subclusters in each population. 

In such a cluster, we can consider the interactions of particles from each population such that the total potential of 
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the cluster is (c.f. Equation 50 1 
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where th e su perscripts denote members of the different populations, and x denotes position. The three terms in 
equation ( Al ) come f rom considering each mass component as a separate system, and adding their mutual interactions. 



Therefore, equation (Al) is similar in form to equation (50), even though we do not place any constraints on the 
positions of individual particles. 



We can take the average of the sums in equation ( Al ) such that 
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where A^i and A^2 are the numbers of particles in each population, and {1/r^^') is the average inverse separation 
between members of population 1. In the case where both populations are similarly distributed throughout the cell, 
the separate averages are approximately equal, i.e. (1/r) w (l/r'^^) w (l/r'^2)^ ~ (l/r'^^'2)) so only the mass and 

number ratios affect th e to tal potential; 

Therefore, equation (61) gives (c.f. Ahmad et al.||2006a ) 
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TO2 or N2 — 0. When the mass ratio 7112 /tri 



which reduces to the single mass case given by equation (61) for mi 

is sufficiently large that the N2{7n2/7ni) term is larger than the A^i term in the denominator, we can ignore the less 
massive component. In such cases, most of the total mass is in the more massive particles and these particles dominate 
the total potential energy of the cell. 

This means that in cases where most of the mass in a cell is concentrated in a single large subcluster, the large 
subcluster dominates the potential and all the other smaller subclusters are satellites around the large subcluster. In 
such cells, the theory we have described thus far does not apply because there are no other similar subclusters for the 
dominant subcluster to cluster with. 

For a cluster with less than 10 subclusters, a mass ratio of (m2/mi) ~ 10 is large enough such that A^2(w2/toi) > Ni. 
This means that we only need to consider subclusters that are at least a tenth as massive as the most massive subcluster 
in the cluster. These subclusters will contain most of the mass in the cluster, and less massive subclusters are likely to 
be satellites around these more massive subclusters. To illustrate this, we compare W*^2comp. with the single mass case 
for different mass ratios and A'^2 in figure [6] In the two population case, W^, is lower than in the single mass case. At 
7112 /m,i ^ 10, the difference may be larger than 20% depending on the number ratios. This means that particles that 
make a significant contribution to the total potential energy will have the same mass to within an order of magnitude. 
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Figure 6. W, 2comp/W, icomp for different total numbers of particles N, numbers of more massive particles N2 and mass ratio m^lrax. 
Left: TV = 5, right: N = 10. 



